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Filed: November 29, 2000 

Title: Dynamically Configurable Processor 

Attorney Docket No.: 872.0025.USU 
Customer No.: 29,683 

Commissioner For Patents 

P.O. Box 1450 
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This is a request for a Certificate of Correction (MPEP 1480) in regard to the above- 
identified patent. Attached is a Form PTO-1050. The mistakes, incurred through the fault of 
the Patent and Trademark Office, are clearly disclosed by the records of the Office as 
indicated by the following description: 

In Claim 1 : Column 8, line 39, after "sources" please add the following language: 
--, wherein said plurality of ALUs, wherein the second mode of operation, operate in parallel 
with one another one data received from said second data sources, and wherein said data 
processor forms a part of a wireless terminal--. See Claim 1 of Examiner's Amendment 
dated May 24, 2005. 

In Claim 8 : Column 9, line 8, after "sources" please add the following language: 

--, wherein said plurality of ALUs, wherein the second mode of operation, operate in parallel 

with one another one data received from said second data sources, and wherein said data 
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processor forms a part of a wireless terminal--. See Claim 10 of Examiner's Amendment 
dated May 24, 2005. 



In Claim 17 : 

Column 10, line 22 please delete "ALU" and replace with -ALUs--. 
Column 10, line 23, after "n-bits" please add the following language: 

-, wherein said DSP forms a part of a wireless terminal — . See Claim 21 of Examiner's 
Amendment dated May 24, 2005. 

The Office is requested to issue a Certificate of Correction. 
Respectfully submitted, 



Robert J. Mauri (Reg. No. 41 ,1 80) Date 

Customer No.: 29683 

Harrington & Smith, LLP 
4 Research Drive 
Shelton, CT 06484-6212 
203-925-9400 



I hereby certify that this correspondence is being deposited with the United States Postal 
Service as first class mail on the date shown below in an envelope addressed to: 
Commissioner For Patents, P.O. Box 1450, Alexandria, VA 22313-1450. 
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UNITED STATES PATENT AND TRADEMARK OFFICE 

CERTIFICATE OF CORRECTION 

PATENT NO. : 6,959,316 82 
DATED : October 25, 2005 

INVENTOR(S) : Ja" A. Parviainen 

It is certified that an error appears or errors appear in the above-identified patent and that said 
Letters Patent is hereby corrected as shown below: 



Claim 1 : Column 8, line 39, after "sources" please add the following language: 
--, wherein said plurality of ALUs, wherein the second mode of operation, operate in 
parallel with one another one data received from said second data sources, and wherein 
said data processor forms a part of a wireless terminal--. 

Claim 8 : Column 9, line 8, after "sources" please add the following language: 
--, wherein said plurality of ALUs, wherein the second mode of operation, operate in 
parallel with one another one data received from said second data sources, and wherein 
said data processor forms a part of a wireless terminal--. 

Claim 17 : 

Column 10, line 22 please delete "ALU" and replace with -ALUs-. 
Column 10, line 23, after "n-bits" please add the following language: 
--, wherein said DSP forms a part of a wireless terminal --. 
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In accordance with the teachings herein the final 32-bit It is assumed that the plurality of additional ALUs 14B 

addition unit 36 can be implemented as two, 16-bit ALUs, made possible by the use of this invention are exploited by 

which are then available as well for other processing tasks suitable programming tools and instructions, such as by the 

besides multiplication. In addition, techniques exist for use of C++ callable intrinsics and/or intelligent compiler 

reducing the number of remaining addition units (see, for 5 design. The teachings of this invention enable improvements 

example, Peter Pirsch: "Architectures for Digital Signal in the operation of certain existing instruction types, as well 

Processing"). One possible approach is use the Carry /Save/ as an ability to define hew instruction types. These teachings 

Addition principle, where it is possible to calculate together are furthermore applicable to a wide variety of DSP archi- 

three input operands and forward the carry signal to the next lecture types, including but not limited to superscalar archi- 

stage(s). This method enables one to optimize the number of 10 tectures and very long instruction word (VLIW) architec- 

addition units. Other techniques exist for implementing the tures. The teachings of this invention can be seen to provide 

multiplier, as well as for optimizing the required hardware. a "coarse-grained" programmability alternative to the rela- 

In practice the multiplied values of A and B can have the tively fixed DSP architectures and the much finer-grained 

same sign or different signs. One practical method for programmability inherent in the use of FPGAs. The teach- 

implementing multiplies is shown in FIG. 10A. The values 15 ings of this invention also avoid the problems inherent in 

of A and B are complemented in A and B complement logic ASIC-based designs, as circuit changes are made rapidly 

units 40A and 40B, respectively, is negative. The outputs of under program instruction control, and can provide a more 

complement logic units 40A and 40B feed the multiplier cost-effective solution than many FPGA designs. The use of 

front end 42, which then always operates on positive values. this invention also enables various enhancements and opti- 

The multiplier front end feeds a partial sum adder unit 44, 20 mizations to be made to the multiplier front end, so as to 

which is followed by complement end result logic 46 which accommodate and work with the use of the ALU(s) at the 

sets the sign of the result correctly depending on the signs of inputs and outputs of the multiplier front end. 

A and B. Thus, while the invention has been particularly shown and 

FIG. 10B depicts a technique for processing negative- described with respect to preferred embodiments thereof, it 

valued operands in accordance with the teachings herein. In 25 will be understood by those skilled in the art that changes in 

FIG. 10B it can be seen that ALUs 50A and SOB replace the form and details may be made therein without departing 

complement logic units 40A and 40B, and are used to from the scope and spirit of the invention, 

perform the complement operation during multiplication, as What is claimed is: 

well as to check the end result and change the sign if 1. A data processor, comprising a multiplier block having 

necessary. The ALUs 50A and SOB can be used for other 30 a multiplier front end for generating partial products from 

purposes at other times, such as for addition, subtraction, input operands, and a plurality of arithmetic logic units 

logical operations (for example, AND, OR, XOR, NOT) and (ALUs) having inputs switchably coupled, in a first mode of 

for shifting. The ALUs 50A and SOB may also be used, as operation, to first data sources comprised of outputs of said 

described above, to implement all or part of the adder block multiplier front end for adding together partial products 

44. 35 received therefrom to arrive at a multiplication result, said 

While described in the context of the programmably inputs of said plurality of ALUs being switchably coupled, 

configurable multiplier block 14, it can be appreciated that in a second mode of operation, to second data sources for 

other enhancements can be made as well to the DSP core 10, performing at least one of arithmetic and logical operations 

such as the use of chainable registers to achieve wider on data received from said second data sources ». CL 

register widths, splittable or combinable ALUs and the like. 40 2. A data processor as in claim 1, wherein said partial 

These enhancement techniques can also be used within the products have a width of n-bits, and where a width of said 

multiplier block 14 itself, in addition to providing the ALUs ALUs is one of n-bits or less than n-bits. 

14B in place of the conventional dedicated adder circuitry of 3. A data processor as in claim 1, wherein said partial 

the prior art (see FIG. 4). By example, FIG. 8 shows one products have a width of 8-bits, and where a width of said 

suitable technique for achieving width scaling with two 45 ALUs is one of 8-bits or 4-bits. 

ALUs, wherein a multiplexer 20 operating under control of 4. A data processor as in claim 1, wherein said partial 

a Select input, or a simple gate such as an AND gate, is used products have a width of 16-bits, and where a width of said 

for chaining together the two ALUs by selectively coupling ALUs is one of 16-bits, 8-bits or 4-bits. 

the carry out terminal of ALU 1 to the carry in terminal of 5. A data processor as in claim 1, wherein said partial 

ALU 2. 50 products have a width of 32-bits, and where a width of said 

The use of this invention provides a number of advantages ALUs is one of 32-bits, 16-bits, 8-bits or 4-bits. 

in the wireless terminal application, as different algorithms 6. A data processor as in claim 1, wherein said partial 

have different signal processing needs. For example, the products have a width of n-bits, where a width of said ALUs 

Viterbi algorithm does not necessarily require 16-bit preci- is less than n-bits, and where at least some of said plurality 

sion, as 8-bit precision may suffice. The use of this invention 55 of ALUs are switchably coupled together to provide an n-bit 

enables changes in precision in a programmable manner, and wide ALU. 

further enables the parallelism of the DSP core 10 to be 7. A data processor as in claim 1, wherein said inputs of 

changed in a dynamic manner. As an example, multiplica- said ALUs are switchably coupled under control of a pro- 

tion operations are required when performing metric calcu- gram instruction. 

lations for the Viterbi algorithm, however for detecting the 60 8. A method of operating a data processor, comprising: 

correct path through the trellis add/compare/select opera- providing a multiplier block having a multiplier front end 

tions are required. The use of the teachings of. this invention for generating partial products from input operations; 

enables the DSP core to be programmably configured to and 

perform the necessary multiplications, and to then be providing said multiplier block with a plurality of arith- 

dynamically reconfigured on-the-fly to increase the number 65 metic logic units (ALUs); wherein 

and parallelism of ALUs in order to perform the add/ in a first mode of operation, said plurality of ALUs have 

compare/select operations. • inputs switchably coupled to first data sources com- 
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prised of outputs of said multiplier front end for adding 
together partial products received therefrom to arrive at 
a multiplication result; and 
in a second mode of operation, said inputs of said plurality 
of ALUs are switchably coupled to second data sources 
for performing at least one of arithmetic and logical 
operations on data received from said second data 

9. A method as in claim 8, wherein said partial products 
have a width of n-bits, and where a width of said ALUs is 
one of n-bits or less than n-bits. 

10. A method as in claim 8, wherein said partial products 
have a width of 8-bits, and where a width of said ALUs is 
one of 8-bits or 4-bits. 

11. A method as in claim 8, wherein said partial products 
have a width of 16-bits, and where a width of said ALUs is 
one of 16-bits, 8-bits or 4-bits. 

12. A method as in claim 8, wherein said partial products 
have a width of 32-bits, and where a width of said ALUs is 
one of 32-bits, 16-bits, 8-bits or 4-bits. 

13. A method is in claim 8, wherein said partial products 
have a width of n-bits, where a width of said ALUs is less 
than n-bits, and further comprising switchably coupling 
together at least some of said plurality of ALUs to provide 
an n-bit wide ALU. 

14. A method as in claim 8, wherein said inputs of said 
ALUs are switchably coupled under control of a program 
instruction. 

15. A method as in claim 8, wherein said plurality of 
ALUs comprise the same or additional ALUs coupled to 
inputs of said multiplier front end for changing a sign of said 
input operands. 
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16. A method as in claim 8, wherein switchably coupling 
employs reconfigurable signal routing logic. 

17. A digital signal processor (DSP), comprising a DSP 
core having a register file, at least one arithmetic logical unit 

5 (ALU), and at least one multiplier block comprised of a 
multiplier front end for generating partial products from 
input operands, said multiplier block further comprising 
circuitry for adding together said partial products, said 

10 circuitry comprising a plurality of ALUs having inputs that 
are programmably coupled, in a first mode of operation, to 
first data sources comprised of outputs of said multiplier 
front end for adding together partial products received 
therefrom to arrive at a multiplication result, said inputs of 

15 said plurality of ALUs being programmably coupled, in a 
second mode of operation, to second data sources for 
selectively operating together in parallel for performing at 
least one of arithmetic and logical operations on data 
received from said second data sources, wherein said partial 

20 products have a width of n-bits, and where a width of 
individual ones of said plurality of ALliis one of n-bits or 
less than n-bits oUd Ufyk%£ 

18. A DSP as in claim 17, wherein said plurality of ALUs 
comprise the same or additional ALUs that are coupled to 
inputs of said multiplier front end for changing a sign of said 
input operands.' 

19. A DSP as in claim 17, and further comprising recon- 
figurable signal routing logic for providing data paths to and 

30 from said plurality of ALUs. 
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